An Extensive Empirical Comparison of <i>k</i>-means Initialization Algorithms

نویسندگان

چکیده

The k-means clustering algorithm, whilst widely popular, is not without its drawbacks. In this paper, we focus on the sensitivity of to initial set centroids. Since cluster recovery performance can be improved by better initialisation, numerous algorithms have been proposed aiming at producing good However, it still unclear which algorithm should used in any particular scenario. With mind, compare 17 such 6,000 synthetic and 28 real-world data sets. sets were produced under different configurations, allowing us show excels each Hence, results our experiments particularly useful for those considering a non-trivial

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Procedure for Empirical Initialization of Adaptive Testing Algorithms

In constrained adaptive testing, the numbers of constraints needed to control the content of the tests can easily run into the hundreds. Proper initialization of the algorithm becomes a requirement because the presence of large numbers of constraints slows down the convergence of the ability estimator. In this paper, an empirical initialization of the algorithm is proposed based on the statisti...

متن کامل

An Empirical Study of K-Means Initialization Methods for Document Clustering

Everyday vast amounts of documents, e-mails, and web pages are generated. In order to handle these data, automatic techniques such as document clustering are needed. The k-means method is a clustering technique widely used in practice because of its simplicity and empirical speed. In this paper, the basic k-means algorithm is augmented with two special initialization techniques that aim at impr...

متن کامل

A Performance Comparison of Event Calendar Algorithms: an Empirical Approach

We propose a suite of tests based on two-state Markov chains for experimentally assessing the dynamic performance of a variety of simulation event calendar implementations. In contrast to previous studies based on the standard hold model for evaluation of performance statically, the proposed Markov hold model is more general and can be used to examine how different implementations respond dynam...

متن کامل

An Empirical Comparison of Any-Angle Path-Planning Algorithms

We compare five any-angle path-planning algorithms, Theta*, Block A*, Field D*, ANYA, and Any-Angle Subgoal Graphs in terms of solution quality and runtime. Anyangle path-planning is a fairly new research area, and no direct comparison exists between these algorithms. We implement each algorithm from scratch and use similar implementations to provide a fair comparison. Introduction For path pla...

متن کامل

An Empirical Comparison of Algorithms for Aggregating Expert Predictions

Predicting the outcomes of future events is a challenging problem for which a variety of solution methods have been explored and attempted. We present an empirical comparison of a variety of online and offline adaptive algorithms for aggregating experts’ predictions of the outcomes of five years of US National Football League games (1319 games) using expert probability elicitations obtained fro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3179803